#### PCT

#### WORLD INTELLECTUAL PROPERTY ORGANIZATION International Bureau



# INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT)

(51) International Patent Classification 6: (11) International Publication Number: WO 98/09208 G06F A2 (43) International Publication Date: 5 March 1998 (05.03.98) (72) Inventors: FOGELIN, John, C.; 250 Coleridge Street, San (21) International Application Number: PCT/US97/14842 Francisco, CA 94110 (US). SMITH, Colin; 22, impasse de la Capitainerie, F-56860 Sene (FR). WILNER, David, (22) International Filing Date: 21 August 1997 (21.08.97) N.; 2733 Claremont Boulevard, Berkeley, CA 94705 (US). HARTMAN, John; 4680 Davenport Avenue, Oakland, CA 94619 (US). LONG, Kent, D.; 6109 McBride Avenue, (30) Priority Data: Richmond, CA 94805 (US). STEIMAN-CAMERON, Deb-08/705,881 28 August 1996 (28.08.96) 211 bie; 30 Waterside Circle, Redwood City, CA 94065 (US). SHEPARD, Marc; 4584 Santa Rita Road, Richmond, CA 94803 (US). ZHOU, Yiwen; 2020 Franciscan Way #208, Alameda, CA 94501 (US). WADDINGTON, Simon; 3 Em-(71) Applicant: WIND RIVER SYSTEMS, INC. [US/US]; 1010 Atlantic Avenue, Alameda, CA 94501 (US). barcadero West #341, Oakland, CA 94607 (US). MAISON-NEUVE, Philippe; 1, allée Louis Aubert, F-56000 Vannes (FR). DARLET, Pierre-Alain; 17, square les Bouleaux, F-56610 Arradon (FR). PREYSSLER, Thierry; 2615 Telegraph Avenue #203, Berkeley, CA 94704 (US). ORIOT, Jean-Claude; 3, rue de Ranquin, F-56860 Sene (FR). ents: MENDENHALL, Larry et al.; Townsend and Townsend and Crew LLP, 8th floor, Two Embarcadero Center, San Francisco, CA 94111 (US). (81) Designated States: JP, European patent (AT, BE, CH, DE, DK, ES, FI, FR, GB, GR, IE, IT, LU, MC, NL, PT, SE). **Published** Without international search report and to be republished

#### (54) Title: A TOOL FOR SOFTWARE DIAGNOSIS DIVIDED ACROSS MULTIPLE PROCESSORS

A target/host computer system wherein traffic on the connecting communications link and the size of the agent in the target are reduced by using Gophers, debugging services on demand, shared run-time libraries, quantized streaming differential caches, host-based target memory management, virtual I/O and in-circuit emulation.

upon receipt of that report.

# FOR THE PURPOSES OF INFORMATION ONLY

Codes used to identify States party to the PCT on the front pages of pamphlets publishing international applications under the PCT.

| AL   | Albania                  | ES | Spain               | LS   | Lesotho               | SI   | Slovenia                 |
|------|--------------------------|----|---------------------|------|-----------------------|------|--------------------------|
| AM   | Armenia                  | FI | Finland             | LT   | Lithuania             | SK   | Slovakia                 |
| ΑT   | Austria                  | FR | France              | LU   | Luxembourg            | SN   | Senegal                  |
| AU - | Australia                | GA | Gabon               | LV   | Latvia ·              | SZ   | Swaziland                |
| AZ   | Azerbaijan               | GB | United Kingdom      | MC   | Monaco                | TD   | Chad                     |
| BA   | Bosnia and Herzegovina   | GE | Georgia             | MD   | Republic of Moldova   | TG   | Togo                     |
| BB   | Barbados                 | GH | Ghana               | MG   | Madagascar            | TJ   | Tajikistan               |
| BE   | Belgium                  | GN | Guinea              | MK   | The former Yugoslav   | TM   | Turkmenistan             |
| BF   | Burkina Faso             | GR | Greece              |      | Republic of Macedonia | TR   | Turkey                   |
| BG   | Bulgaria                 | HU | Hungary             | ML   | Mali                  | TT   | Trinidad and Tobago      |
| BJ   | Benin                    | IE | Ireland             | MN   | Mongolia              | · UA | Ukraine                  |
| BR   | Brazil                   | IL | Israel              | MR   | Mauritania            | UG   | Uganda                   |
| BY   | Belarus                  | IS | Iceland             | MW   | Malawi                | US   | United States of America |
| CA   | Canada                   | IT | Italy               | MX   | Mexico                | UZ   | Uzbekistan               |
| CF   | Central African Republic | JР | Japan               | NE ' | Niger                 | VN   | Viet Nam                 |
| - CG | Congo                    | KE | Kenya               | NL   | Netherlands           | YU   | Yugoslavia               |
| СН   | Switzerland '            | KG | Kyrgyzstan          | NO   | Norway                | zw   | Zimbabwe                 |
| CI   | Côte d'Ivoire            | KP | Democratic People's | NZ   | New Zealand           |      |                          |
| CM   | Cameroon                 |    | Republic of Korea   | PL   | Poland                |      |                          |
| CN   | China                    | KR | Republic of Korea   | PT   | Portuga!              |      |                          |
| CU   | Cuba                     | KZ | Kazakstan           | RO   | Romania               |      |                          |
| CZ   | Czech Republic           | LC | Saint Lucia         | RU   | Russian Federation    |      |                          |
| DE   | Germany                  | LI | Liechtenstein       | SD   | Sudan                 |      |                          |
| DK   | Denmark                  | LK | Sri Lanka           | SE   | Sweden                |      |                          |
| EE   | Estonia                  | LR | Liberia             | SG   | Singapore             |      |                          |
| •    |                          |    |                     |      | •                     |      | •                        |

15

20

25

30

35

# A TOOL FOR SOFTWARE DIAGNOSIS DIVIDED ACROSS MULTIPLE PROCESSORS

#### BACKGROUND OF THE INVENTION

This application claims priority of U.S. Provisional
Patent Application No. 60/005,110, filed September 1, 1995,
entitled "A Tool for Software Diagnosis Divided Across
Multiple Processors," with Attorney Docket No. 014274-000200.

This invention relates to software development and debugging environments. In particular, the invention relates to development and debugging environments wherein the software to be analyzed resides in a processor system distinct from but connected to another processor system running the debugging software's interface with the user.

In the development of software, the use of a debugging program (or "debugger") is not uncommon to analyze software to be developed or known to have a bug. This is the test software. One of the primary functions of such a debugger is to allow the user -- typically a development or diagnostics engineer -- to view the memory as modified by the execution of the test software.

As C and C++ are the programming languages currently enjoying the greatest acceptance among programmers in the art, this invention is described in C and C++. Of course, any programming language capable of the concepts discussed herein can realize this invention and its benefits.

The problems of debugging software are amplified in certain respects when the processor system on which the test software resides (the "target system" or "target") has insufficient resources to support a debugging environment. The usual critical resource is memory. In these situations, a substantial portion of the debugging environment can be displaced to a separate remote system, the "host system" or "host," and the target and host systems communicate by a

10

15

25

30

35

40

communications link. The software on the target communicating over the link is the debugging agent, and that portion residing on the host is the debugging server.

The resolution of the limited memory problem by displacement results in other problems: communicating using the limited bandwidth of the communications link, for example. The bandwidth limitation is particularly acute when mass or high-speed data throughput is not the primary function of what becomes the communications link for development or debugging purposes or when complex and/or large data structures such as linked lists or tables must be retrieved by the debugging server from the debugging agent. In prior art systems, a significant portion of the bandwidth expended on retrieval of data from the target is simply the transmission of data requests from the server to the agent. For example, the retrieval of the linked list linked\_data\_list of the following data structure

structure datum

{
 int \* first\_element;
 int second\_element;
 datum \*datum\_pointer;
} \*linked\_data\_list;

requires first requesting and retrieving the address of the datum structure at the head of the list (\*linked\_data\_list), then requesting and retrieving sizeof(structure datum) bytes from that address, then requesting and retrieving sizeof(structure datum) bytes from \*(linked\_data\_list->datum\_pointer), and so on. Any reduction in the amount of data transferred as data requests or the amount of data transferred in response to data requests would be a relief from the constraints of the narrow-channel communications link.

Another prior art method for retrieving linked\_data\_list would be to have a special service, GET\_linked\_data\_list, built into the debugging agent. Thus, when the debugging server wants to look at linked\_data\_list, it simply invokes the GET\_linked\_data\_list service. However,

given the variety of variable types typically found in an operating system and the additional number in the attendant application code, such a scheme does not bode well for minimizing the footprint of the debugging agent on the target memory. The alternative of restricting the freedom of the programmers in declaring variable types is not attractive either.

Even with displacement, the debugging agent can still consume a not insignificant amount of the memory of the target computer. An enduring design struggle is to provide sufficient services in the debugging agent to create a useful and even capability-laden agent while attempting to keep memory consumption to a minimum. A part of this struggle is whether to include services which are not frequently used but are extremely helpful if present when needed.

The uploading of data to the host computer can consume a major percentage of the available bandwidth on the communications link. Typically, the debugging server lacks the intelligence to ask whether the data being transferred is duplicative or otherwise unnecessary. Accordingly, there is a need for a debugging server which screens requests for data of the target and thereby reduces the amount of traffic on the communications link. There is also a need for a debugging server to screen data destined for the target and thereby reduce the amount of traffic on the communications link.

Many device-controlling systems provide only the communications interfaces necessary to control the device or devices of interest. One justification for so limiting the interfaces is cost. However, in a target/host development configuration as described above, at least one communications interface is given over to supporting the target/host configuration. Accordingly, the reallocation of communications interfaces disturbs the system under test, and the test scenario loses a certain amount of verisimilitude or verifiability. It would be preferable to connect all the controlled devices as well as the host system to the target machine — even in configurations where each of N controlled

10

15

20

25

30

35

devices requires a respective one of N total communications interfaces.

Some systems do not incorporate any means for communicating with the external world during normal operation. Nonetheless, a developer would like to debug in situ these and other similarly situated systems (e.g., systems where interrupting (any of) the fully allocated communications interface(s) is not preferable). A technique to enable the developer to debug these severely resource-limited systems is needed.

Prior art exists to establish a communcation path to a resource-constrained system as described. These devices include the NetRom product available from Applied Microsystems Corporation of Redmond, Washington. Typical of its class, the NetRom product connects into a ROM socket on the target side and provides a standard ethernet or serial interface to the host. However, to knowledge, such devices have been used primarily to replace programmed ROMs in unstable systems. Reprogramming the software available via a NetROM is easier than reprogramming an actual ROM.

Accordingly, an object of this invention is to create a debugging environment which has a smaller footprint in the memory of target systems.

Another object of the invention is to minimize the size of the debugging agent in the target memory while still offering all of the capabilities a software developer expects from a full-featured debugging environment.

An object of the invention is the addition of intelligence to the host-side software in order to reduce the consumption of available bandwidth by reducing both the number and size of target/host transactions.

Another goal of the invention is the addition of intelligence to the host-side software in order to obviate certain target-side intelligence and thus free the memory the target-side intelligence would consume.

Yet another goal of the invention is, in a target/host development environment, to connect all controlled devices as well as the host system to the target machine,

particularly when each of N controlled devices requires a respective one of only N available communications interfaces.

Another object of the invention is, in a target/host development environment, to connect to the host a target machine with no conventional communications interfaces available.

Still other objects and goals of the present invention will be obvious to a person of ordinary skill in the art on reading the background above and the disclosure below.

10

. 15

20

25

30

35

5

# SUMMARY OF THE INVENTION

The present invention is apparatus and methods coupling a first, target processor to a second, host processor. The target processor runs a debugging agent while the host processor runs a debugging server application which interacts with its counterpart, the debugging agent. The two processors are coupled by a communications link.

In one embodiment, to retrieve data from the memory of the target processor to the memory of the host, the host processor generates a program in an interpreted language to read the data from the memory of the target processor and communicates that program to the target processor. The target processor interprets the program, reading the data and communicating it over the communications link to the host processor. The program and/or the data communicated can be compressed for transmission and decompressed on reception. The advantages of the apparatus and method include reduced traffic on the limited bandwidth of the communications link and the avoidance of dedicated target-based services of limited use.

In another embodiment, the application to be developed and the debugging agent share libraries and provide debugging services from the shared libraries. Application and debugging code are partially or fully linked together. User demands are satisfied by executing code in the core common to the application and the debugging agent. In a preferred embodiment, a copy of the core is placed on the target machine while a copy of the symbol table is placed on the host. The

of the debugging agent on target memory;

5

10

15

20

25

30

35

approaches.

debugging agent is then scalable beyond the services in the core by default. The amount of redundancy between the debugging agent and the application is at the discretion of the application developer. The developer controls the impact

6

In another embodiment, in order to debug applications, the program text of the debugging agent is supplemented to provide debugging services as the user demands them. Scalable debugging agents fundamentally extend the range of scalability beyond that of traditional target/monitor

In one embodiment, the debugging target and server cooperate to read memory of the target computer into the memory of the host computer coupled to a user input device. When a user (e.g., a user program displaying a thread of execution by continually displaying the relevant stack) issues first and second commands to read the memory of the target computer, the host computer determines the time differential between the sequential commands and reads the memory in response to the second command only if the time differential is greater than a predetermined value.

In another embodiment, the host computer is again coupled to a user input device. When a user directs a modification of the memory of the target computer, the host computer generates a script for moving data extant in the memory of the target computer and/or copying over data to the memory of the target computer so as to recreate the modification specified by the user. Where the size of the modification is small compared to the size of the total data, communicating the changes rather than the total modified data conserves the limited bandwidth of the communication means coupling the two computers together.

In yet another embodiment, the target processor is adapted for coupling to and controlling an external device. The target processor has a number of communications channels needed to operate in its final configuration, including one which controls the external device. The host processor uses this one communications channel for coupling to the target

15

device and uses one of its communications channels to couple the target processor to the external device.

In a further embodiment, the invention couples the host computer and the target processor at the debugging target/server level by means of an in-circuit emulator. One such in-circuit emulator is a ROM emulator.

# BRIEF DESCRIPTION OF THE DRAWINGS

Figure 1 illustrated a cross-platform environment according to a preferred embodiment;

Figure 2 illustrates a host/target system with a host-resident cache;

Figure 3 is an abstract diagram of a host/target system and their respective memories;

Figure 4 illustrates the levels of software supporting an embodiment of a virtual I/O channel;

Figure 5 illustrates a target and host connected by remote communications; and

Figure 6 illustrates the architecture of the target debugging server.

# DESCRIPTION OF THE PREFERRED EMBODIMENT

| 25  | TABLE OF | CONTENTS                                 |   | • |   |     |     |
|-----|----------|------------------------------------------|---|---|---|-----|-----|
|     | 1.       | Overview                                 |   |   |   |     |     |
|     | 2.       | Gophers                                  |   | • | • | •   |     |
| 30  | 3.       | Debugging Services on Demand             | • | • | • | • . | 1   |
|     | .4 .     | Shared Run-Time Debugging Services       |   | • | • | •   | 20  |
| 3.5 | 5.       | Quantized Streaming Differential Cache . | _ | • | • | •   | 2:  |
|     | 6.       | Host-Based Target Memory Management      |   |   | • | •   | 24  |
|     | 7.       | Virtual I/O                              |   | • | • | •   | 25  |
| 10  | 8.       | In-Circuit Emulation                     |   | • | • |     | 27  |
|     | . 9.     | World-Wide Access                        | • | • | • | •   | 28  |
| 5   | WHAT     | IS CLAIMED IS                            | • | • | • |     | 30  |
|     | 1. Over  | view                                     |   | - | , | -   | - 0 |

The processor system most likely to benefit from the invention disclosed herein is the cross-platform for software development or software debugging. Figure 1 illustrates a 8

cross-platform environment 100 incorporating the invention. In such a cross-platform environment, the debugging software is divided in two. The first portion of the debugging software, the debugging agent 110, resides on a target computer 120 along with the software 130 undergoing development or debugging. The second portion of the debugging software, the debugging server 140, resides on a host computer 150 which provides an interface (here Tornado™ Launcher 160) to the developing or debugging user. The target and host computers are connected by means of a communications link 170 which can be serial, parallel, LAN, WAN, or any other form of I/O or hardware-assisted emulator or romulator.

The preferred operating system for the target is the VxWorks™ operating system 130b, available from the assignee of this patent application. Communications between the host and target are by means of the ONC RPC protocol, which can be implemented on virtually any transport layer. The host and target respectively support the following services (described from the host's vantage):

20

30

. 35

. 15

5

- •ping a target
- ·connect to the debugging agent
- ·disconnect from the debugging agent
  - ·set the debugging agent mode
- •read a target memory region
  - ·write a memory block to the target
  - ·fill a target memory block with a pattern
  - •move a target memory block
  - compute the checksum value of a target memory
  - •turn protection on or off for a target memory
    region
  - ·update the target instruction cache
  - ·search for a pattern in the target memory
  - ·create a context on the target
  - ·destroy a context on the target
  - ·suspend a context on the target
  - ·resume a context suspended on the target

- •get values from target registers
- ·set values of target registers
- •write data to a target virtual I/O channel
- •add an eventpoint on the target
- ·delete an eventpoint on the target
- ·get an event from the target
- continue a context suspended on the target
- ·single step a context on the target
- ·call a function on the target
- ocall a function in the agent
  - ·evaluate a gopher string on the target
  - ·check if an event is pending on the target
  - ·clear an event pending on the target
  - ·call a service on the target
- •free the memory used by the returned structure

  Each of these services is described in the Tornado™ API Guide,
  available from the assignee of the instant patent application.

#### 2. Gophers

In order to reduce traffic on the communications link between the target and host, a system uses a first embodiment of the invention: Gophers. Gophers are interpreted programs for efficiently retrieving data from the memory of the target to the memory of the host. Gophers are particularly useful when the data to be retrieved has a complex structure.

In a preferred embodiment, the Gopher language has only one state indicator, a programmable variable referred to as the "pointer." Also, the Gopher language has only one I/O device, an abstract "tape." All Gopher actions are interpreted relative to the value of the pointer, and all output is via the tape.

The Gopher language also includes the following syntactic elements:

Table I:

30

| Directive        | Action                                                                                                                                                                                                                                                   |
|------------------|----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| n                | Set the pointer to the integer value n                                                                                                                                                                                                                   |
| [+-]n            | Increment or decrement the pointer by the integer value n                                                                                                                                                                                                |
| 1                | Write the pointer to the tape                                                                                                                                                                                                                            |
| *                | Replace the pointer with the value pointed to by the pointer                                                                                                                                                                                             |
| @[bw1]           | Write the value pointed to by the pointer to the tape, and advance the pointer. "B," "W," and "L" indicate that the size of the value is 8, 16 or 32 bits, respectively. The pointer is advanced by the size of the value written. ("L" is the default.) |
| \$               | Write the (zero-terminated) string to which the pointer points to the tape.                                                                                                                                                                              |
| <b>{ • • • }</b> | Create and use a local copy of the pointer to repeatedly evaluate the Gopher string between the curly braces until that pointer is zero. Upon completion, the local copy of the pointer is destroyed and the original is unchanged.                      |
| < >              | Create and use a local copy of the pointer to evaluate the Gopher string between the angle braces once. Upon completion, the local copy of the pointer is destroyed and the original is unchanged.                                                       |
| (n )             | Create and use a local copy of the pointer and evaluate the Gopher string inside the parentheses n times.                                                                                                                                                |

A Gopher can be used to retrieve, for example, the linked list ifnet containing most of the information for network interfaces in the preferred target operating system.

The structure ifnet is declared as follows:

10

```
struct ifnet {
         char*if_name;
                            /* name, e.g. ``en'' or ``lo'' */
         shortif unit;
                            /* sub-unit for lower level driver */
         shortif mtu;
                            /* maximum transmission unit */
  5
         shortif flags;
                            /* up/down, broadcast, etc. */
         shortif timer;
                            /* time 'til if_watchdog called */
         int if metric;
                            /* routing metric (external only) */
         structifadder *if_addrlist; /* linked list of addresses per
                                       if */
       /* Some members of this structure have been omitted here for
 10
     simplicity. */
       /* generic interface statistics */
         int if ipackets;
15
                                 /* packets received on interface */
                                 /* input errors on interface */
       # int if errors;
                                /* packets sent on interface */
/* output errors on interface */
        int if opackets;
        int if oerrors;
        int if collisions;
                                /* collisions on cama interfaces */
      /* end statistics */
20
        structifnet *if next;
        *ifnet ptr;
                 An algorithm to traverse the ifnet list might appear
25
      as follows:
           p = ifnet;
           while (p != NULL)
                print p;
30
                p = p->if next;
           }
```

and could be implemented at the host-target interface as a target-implemented service or by constructing a command file or application which repeatedly uses the target memory read service. The equivalent Gopher, however, for the algorithm is:

ifnet { ! + 84 \* }

40

45

This Gopher program is deconstructed as follows: ifnet is the integer address of the ifnet pointer, which can be obtained from the symbol table by means well-known in the art or by querying the debugging server 140 regarding symbol table 180. <ifnet> sets the value of the Gopher pointer. The left brace creates a local copy of the pointer and starts a Gopher loop. The Gopher interpreter will execute the subprogram "! + 84 \*" until the local copy of the pointer is

20

25

30

35

not NULL (zero) when the right curly brace is reached. The syntactic element! writes the value of the (local) pointer to the output tape, and "+84" then increments the (local) pointer. The "\*" expression causes the (local) pointer to be replaced by the quantity to which the (local) pointer currently points. A link is thus followed. The "}" delimits the end of the loop, which is terminated when the address to which the (local) pointer currently points is NULL.

In all Gopher expressions using matching delimiters

(i.e., "{ . . . }," "< . . . >" and "( . . . )") the Gopher
evaluator saves unchanged the pointer in effect outside the
delimiters and creates and uses another pointer for use during
the execution of the delimited expression. The evaluator
restores the outside pointer when the closing delimiter is
executed. While no code follows the closing delimiter in this
example, there may, of course, be occasions when such code
will exist.

Also, all writes are terminated by white space. This allows the contents of the tape to be processed by a subsequent filter having no additional knowledge of the internal characteristics of the data on the tape.

Note that Gophers have no symbolic capacity and do not understand the layout of structures. The "+84" expression was determined from inspecting the size of the elements constituting the ifnet data structure. This is a cost of the language independence of Gophers. Advantages include minimal syntactic elements and compactness.

The above example returns a list of network interface descriptors. A more sophisticated Gopher can extract desired fields from each of those data structures.

The fields of interest in the ifnet data structure are deemed to be the name of the interface, the sub-unit for the lower-level driver, the maximum transmission unit, the flags, the routing metric and the five packet-statistics, respectively, the if\_name, if\_unit, if\_mtu, if\_flags, if\_metric and if\_ipackets through if\_collisions members.

With the Gopher's pointer pointing to the first byte of an ifnet data structure, it also points to the address of

the string naming the interface. A Gopher program string to dereference the pointer and write the string addressed follows:

5 <\*\$>

10

15

30

The left angle bracket introduces a subcontext which saves the value of the outer pointer. After execution of the ">," the pointer still points to the top of the data structure, despite the dereferencing and advancement of the local pointer caused by the pointer-replacement and string-write actions.

To fetch the next four fields of interest, with the pointer still pointing to the if\_name field, the following Gopher program string advances the pointer over the four bytes of the if\_name pointer ("+4") and then writes the value pointed to by the now-advanced pointer ("@"):

#### +4 @w @w @w +2 @

Because if\_unit, if\_mtu and if\_flags are shorts, the value write actions are modified with "w." The Gopher program string "+2 @" advances the pointer over the watchdog timer element and writes the external metric, a four-byte quantity.

The packet statistics are consecutive within the
data structure and the following program string suffices to
retrieve them:

+48 @@@@@

Thus, the entire Gopher program to retrieve the ten fields of interest is:

# <\*\$> +4 @w @w @w +2 @ +48 @@@@@

Assuming one byte per character in the Gopher

language, the above program consumes only 32 bytes -- 24 bytes
if the unnecessary white spaces inserted for human form
factors are removed. Only one remote procedure call to
execute this 24-byte Gopher program is necessary to retrieve

the desired data, and only the 30+ bytes actually desired are transferred.

By comparison, in order to limit the number of RPCs, the prior art can read large blocks of data of, say, sizeof(struct ifnet) bytes or more. Eighty-four bytes are transferred when only 30 bytes are desired, and some extraneous number of bytes from the large blocks will be transferred as well. Also, the number of RPCs increases as large blocks are read as necessary to retrieve the NULL-terminated name of the interface, and the number of extraneous bytes transferred also increases.

On the other hand, in order to limit the number of bytes of data transferred in response to requests, the prior art can execute a series of RPCs, one for each non-contiguous block of ifnet data structure members desired.

The superiority of Gophers is highlighted when the retrieval of the above 30+ bytes from each of a number of linked ifnet structures is considered. For N such structures, the prior art RPCs increase by a factor of N, and the extraneous data transferred increases by the same factor. In starkest contrast, the Gopher program increases by a mere one byte from

ifnet < <\*\$> +4 @w @w ew +2 @ +48 @@@@@ >

25

30

35

5

10

15

20

to

ifnet { <\*\$> +4 @w @w +2 @ +48 @@@@@ \* }

Further, with Gophers, the number of RPCs remains at the lowest number possible: one. The significance of this cannot be overstated. As a person of ordinary skill in the art will appreciate, different levels of the communications standard supporting the remote procedure calls (e.g., the International Standards Organization ("ISO") Open System Interconnection Reference Model ("OSI")) impose overhead, and the bandwidth which the cumulative overhead consumes can be

15

truly large. An ability to keep the number of RPCs to the absolute minimum is necessary to minimize bandwidth consumption.

5

10

15

20

25

30

35

The prior art can minimize the number of RPCs (and thus the bandwidth consumed in sending data requests), or it can minimize the amount of data requested (and thus the bandwidth consumed in sending data back), but the prior art does not minimize both the number of RPCs and the amount of data transferred in response to those RPCs. With its' ability to not merely reduce but truly minimize the consumption of bandwidth, Gophers represent a clear and major improvement over the known art.

The debugging agent includes an interpreter for the Gopher language. Execution time of the interpreted Gopher is greater than that of a compiled and linked Gopher would be but still represents only a very small fraction of the overall transfer time. The Gopher interpreter itself is very small. In fact, the Gopher interpreter available from the assignee of this patent application is smaller than the amount of the memory that would have been required by target routines to gather the data of only the basic system objects.

A compression-decompression scheme can reduce still further the bandwidth which the transfer of a Gopher program or of the data in response to the Gopher program consumes. If, for example, the linked list in interest was a list of buffers known to be mostly zeroes, a compression-decompression scheme initiated either by the server or the agent and communicated to the other would still further reduce the load on the communications link.

The Gopher language is compact to reduce communication overhead and is accessible at higher level interfaces within the development environment.

In one embodiment, some or all Gophers are pregenerated based on the information available from the symbol table regarding the data items of interest. Thus, when a user directs the retrieval of a particular data structure, the server need only recall and transmit the pre-generated Gopher produced in anticipation of that direction.

10

15

20

25

### 3. <u>Debugging Services on Demand</u>

With the debugging environment divided between the agent on the target and the server on the host computer, still further reductions in the size of the debug environment in the target memory are nonetheless desirable. Accordingly, this invention provides a scalable debugging agent.

The debugging server 140 has the capability of incrementally loading object modules into the target system 120. To achieve this end, the debugging server 140 uses the symbol tables contained in every object module to build a system-wide symbol table of functions and variable names loaded in the target system 120. The debugging server 140 adds names from system and application modules 130 alike to the system symbol table. Indeed, the debugging server adds names from the debugging agent 110 itself to the system symbol table.

Fig. 6 illustrates the architecture of the target debugging server 140. As shown in Fig. 6, the debugging server 140 maintains a copy 610 of the debugging agent 110 on the host (or, at least, a copy of the symbol table of the agent). This copy 610 allows the debugging server 140 to read and analyze the agent 110's core file to determine what symbols are currently available (resolved) in the agent core. Without a single upload, the debugging server 140 is "bootstrapped" with the core information. The debugging server 140 can immediately execute user commands entered through a shell 160 and can perform other functions — constrained only by what services have been linked into the core.

Correspondingly, the host software 160 will typically include a shell 160d or 160e (with underlying hardware, not shown) or other mechanism for communicating with the user. The host accepts commands on this user input device from the user. (A device for user input is most typically a keyboard or mouse. However, a disk, tape or other medium file containing commands previously stored by the user can also be a device for user input.)

10

15

20

25

30

35

Some of these commands the software 160 can satisfy directly; others may require supplemental information or service from the debugging server 140. Still other commands may require information or services which only the agent 110 can provide. For these commands requiring information or services from the agent 110, the debugging server 140 breaks the user command (as relayed by the software 160) into the necessary transactions with the debugging agent 110.

Before the debugging server 140 requests a service of the debugging agent 110, the debugging server 140 checks the host-resident copy 610 of the debugging agent core to determine whether the symbol(s) necessary to provide the service are resolved in the debugging agent 110.

If the symbols which provide the desired functionality are resolved, then the debugging server 140 can simply request (e.g., via RPC) that the debugging agent 110 provide the service. Where the symbols providing the requested service are not resolved in the agent 110, the debugging server 140 reads a memory device (not shown) to retrieve the object module containing the text, data, bss or literal sections (depending on the unresolved symbols present and the format of the object module) to which the unresolved symbols refer.

In one embodiment, the host 140 then relinks the debugging agent image on the host 150 to include the new sections, resolving the symbols of interest, and produces a list of differences between the old agent image and the new agent image. Using the differences list, the debugging server 140 then directs the debugging agent 110 through a series of target memory modifications (memory moves, writes, etc.) to reproduce in the target memory the re-linked debugging agent core on the host.

In an alternative embodiment, after reading the memory device to retrieve the sections to which the unresolved symbols refer, the debugging server 140 directs the debugging agent 110 to accept the sections and relink itself. This second embodiment is less preferable in that it requires a

WO 98/09208

- 5

10

15

20

25

30

35

larger debugging agent 110 with sufficient intelligence to relink itself.

The result of both of the foregoing embodiments is a debugging agent 110 with the symbols supporting the requested service resolved. The debugging server 140 then directs the debugging agent 110 to perform the desired service, thereby providing the information or service which the host software 160 requires and performing, directly or indirectly, the user's command.

In a preferred embodiment, the debugging server can direct the agent to discard the newly loaded code after use.

A debugging agent that is scalable allows for an agent equipped to handle one environment, say, a production environment, to be extended when necessary to support another environment, say, a development environment or an in-the-field debugging environment. A scalable debugging agent is equipped initially with an optimal subset of the set of all services which it could offer on a target of substantially unlimited resources, and the subset is extendable on demand to offer any of the full set of services. In a preferred embodiment, scaling of the agent is automatic, occurring in a manner transparent to the user.

The primary services which a scalable debugging agent 110 offers are target memory write and target service calls. These primary initial services are sufficient to support the corresponding server 140 on the host.

As mentioned above, a program 160 with a user interface for debugging the target software resides on the host. In the preferred embodiment, this program 160 is the remote source-level CrossWind™ debugger 160d available from the assignee of this patent application. The CrossWind™ debugger 160d extends the GNU Source-Level Debugger (GDB) with a graphical user interface and with a comprehensive Tcl scripting interface which allows developers to create sophisticated debugger macros and CrossWind™ extensions. The CrossWind™ debugger 160d permits a user to spawn and debug tasks on the target. The user can also debug already-running

19

tasks spawned with a runtime shell. Debugging includes setting system and task-level breakpoints.

5

10

15

20

25

30

35

The memory device to which the host debugger 160d refers for the original debugging agent core file or for the additional sections can be a conventional host-based RAM, ROM, disk, user input device or other host-based memory device. Alternatively, the host debugger program 160d can query the agent 110 itself for this information. (The agent 110 is typically loaded from ROM.) Accordingly, the agent 110 contains the origins of itself and its scaling object modules. Querying the agent 110 allows use of agents 110 other than those provided by the assignee of the patent application because no assumptions are made about the construction of an agent 110. The host explicitly asks the agent 110 for information as necessary.

As a precautionary measure, the host may request the agent 110 to provide a checksum of its text region. The host then compares this checksum against a checksum of the text region of the core file specified by the agent 110. One reason that the two checksums might differ is that the core file on the host may have been updated independently of and after the writing of the agent to the target ROM.

The types of services which can be thus incrementally loaded are limited only by the services which must initially reside in the target in order to enable the loader and user interface in the server. The following agent services are good candidates for dynamically scaling into the agent 110: Serial line testing, breakpoint support, context (task) control, event support, host exception notification, external context function calls, gophers, memory services, (other than write and move) register access and virtual I/O support.

Of course, if dynamic scaling as described above is applied to even the most mundane agent services, the size of the debugging agent 110 in the target memory is guaranteed to always be at a minimum. On the user's first invocation of a particular command, the debugging server 140 can download the supporting code -- transparently to the user -- to the agent

WO 98/09208

No (or minimal) services will exist in the agent 110 which the user did not actively need and thus no (or minimal) excess code will exist in the target memory. The debugging agent 110 is essentially customized for the particular debugging session.

In order to avoid the target 110's reserving the maximum amount of memory a scalable agent may ever need just in case the agent grows to that size, the invention includes a method for sharing memory between the debugging agent and the operating system software or other application of the target. In a less complicated form, the debugging agent is allowed to grow, consuming more and more memory. In a more sophisticated form, the debugging agent both grows and shrinks, consuming and releasing memory in the process.

15

20

30

35

10

5

#### Shared Run-Time Debugging Services 4.

The invention includes another method for reducing the size of the debugging agent 110 in the memory of the target computer 120. Typically, application code 130 relies upon a library of functions provided with the application development environment. Among other things, these standard libraries obviate the need for the program developer to program and re-program commonly used routines, and they can make the interfaces to operating system services more programmer-friendly.

25

The conventional wisdom is that a debugging agent (for that matter, debugging tools in general) and the program 130 to be debugged should not share libraries. may be an interaction between the agent 110 and the program 130 which creates or, more subtly, hides a problem. these standard libraries can be very large and duplicating them for both the agent 110 and the application 130 makes the amount of target memory occupied even greater. Accordingly, sharing standard libraries between agent and application can offer a material benefit in conserving target memory.

In one embodiment, this invention provides a simple mechanism to determine if code is to be shared or not. incrementally adding new application code to the target system

10

15

20

30

35

120, the host-based loader 620 merely references the most recently defined symbol for the purpose of symbol resolution. Duplicate entries in the system symbol table are not rejected. In this way, at his discretion, a user may download redundant copies of services provided by the agent in the core image or may link against the resident services.

Such an approach need not be limited to the entirety of a library. Even the sharing of a single routine within a resident library can be avoided. Consider the situation where both the agent and an application need functions f() and g() which exist in a library which could be shared. application can be linked such that function f() comes from the library shared with the agent. Function g() can be supplied in the application's source code, and incrementally downloaded to the system before the remaining body of application code is incrementally downloaded. preferential reference of the loader to the more recent definition of the function g() will resolve the reference to the unshared definition of g(). In prior art, such control over the degree of shared runtime code has not been possible.

When the scalability of the debugging agent 110 is combined with shared libraries, the advantages are geometric.

#### 5. Quantized Streaming Differential Cache

25 In responding to user requests, the debugging server classically turns to the debugging agent, asking for data to be uploaded. The debugging server then uses the uploaded data to produce a response to the user. If the user repeatedly requests the same data, a conventional server will repeatedly request the data of the agent and accordingly prepare the responses to the user.

According to the present invention, the copies of portions of the target memory which have been uploaded and now reside in the memory of the host can be likened to a data cache, with the original data residing on the target computer.

Further, as with all data caches, the question arises: When are reads satisfied from the cache and when must the read be satisfied from the original data? Conventional

25

. 30

35

debugging servers ignore the caching potentials and always satisfy data read requests from the original data on the The proffered justification is the need to have the latest information from the target. However, most of the reads performed to satisfy these requests are not performed in 5 a locking mode wherein the accuracy of the data being read is assured by locking out any code which could modify the data during the reading. Even with reads performed with locking, there is no guarantee that the modifying code will not change the original of the read data after completion of the locked 10 read but before its presentation to the user. There is no guarantee that what the user sees will be the actual state of the target system. In short, conventional debugging servers or agents cannot insure that data presented to the user is accurate and sometimes cannot insure that data presented is 15 internally consistent.

It is preferable to use Gophers performing locking reads to upload data structures associated with system objects. This method insures at least the internal consistency of important objects.

Recognizing that data reads to satisfy user requests may not produce consistent or accurate data, a system incorporating an embodiment of the invention maintains a variable memory register 230 holding a threshold age. Under certain conditions described below, the system avoids consuming bandwidth with data reads.

Figure 2 illustrates a host/target system 220 with a host-resident cache 210. The host software 140 satisfies data requests from the cache 210 if the data in the cache 210 is younger than the threshold age. If the threshold age is some predetermined value, such as zero, the server 140 always satisfies data requests by reference to the agent 110. Preferably, the user chooses the threshold age for the particular situation. Experiential data suggests that a threshold age of one second suffices for day-to-day situations.

In another embodiment, the server maintains individual age thresholds for respective blocks of data in the

host cache. Accordingly, data known or deemed to be volatile can be refreshed at a greater rate than data known or deemed to be more static. In both of these embodiments, the flexibility is in the hands of the user.

Another issue associated with caching is writing back: modifying the original when the copy has been modified. The debugging server cache 210 disclosed herein is "write-through," the data on the target 120 is updated essentially immediately after the copy on the host 150 is modified. The justification here is that the user always intends to modify the data actually on the target. Indeed, the user may not even be aware that the host software is performing any caching.

In certain circumstances, the required writing back can be significantly streamlined. When the original data in the target memory is unchanging (and, therefore, consistent with the host copy), only the writing back of the changed portions of the data is necessary, rather than writing back the entirety of the cached data. Thus, where cache blocks are, say, a kilobyte in size and the change affects only tens or small hundreds of sequential bytes, an advantage lies in instructing the debugging agent 110 to modify the affected tens or hundreds of bytes rather than instructing the agent 110 to re-write the entire kilobyte.

Another example is the reloading of a software application or module. Usually, when an application is changed to remedy some bug, only a small percentage of the application text changes (to add or delete a few lines of code) when re-compiled. Yet in the prior art, when this application is loaded onto the target, the loading consumes bandwidth as necessary to transfer the entire file. However, the invention views the application as a stream of data and the typical recompilation as the insertion and/or deletion of a small portion of the stream. When downloading a new stream of memory to the target 120, the stream is compared with the copy of the original target contents cached on the host 150, and the target 120 is instructed to adjust its memory to slip contents forward or backward to agree with the new module

being constructed. The host 150 then transfers new data, if any, to the target 120 to fill the newly vacant or other memory spaces.

Relocation fields within the stream of data complicate the algorithm because they will almost always differ. However, the list of locations of the fields for both the original and the new stream of data are known beforehand. If the module is re-linked after transfer, these differences are inconsequential.

10

15

20

25

30

. 35

5

# 6. Host-Based Target Memory Management

In some cases, the development tools may require resources from the target in addition to those required by the application software on the target. These additional resources can be, for instance, sophisticated services which require exclusive access to an allocated portion of target memory. Among the most sophisticated of these memory-related services is dynamic memory management.

Figure 3 is an abstract diagram of a host/target system 300 and the respective memories 310 and 320. Systems embodying the invention take into account the fact that the memory 310 of the host system 150 is usually much larger than the memory 320 of the target system 120 and, therefore, the host system 150 can provide storage for many house-keeping data structures used in the management of the target-resident memory 320. With the intelligence (and code space) in the host 310 managing free lists of memory and other cumbersome data structures, the debugging agent 110 need only provide services such as move, copy and fill.

The two data structures typically supporting a memory manager are a free memory list 330 and the allocated list (not shown). (The latter is sometimes excluded for efficiency.) In one embodiment, the debugging server 140 will allocate host-based memory for these data structures and thus realize zero-overhead performance when allocating target memory. The free list 330 is a linked list of free blocks with each link in the list containing the address of the corresponding contiguous free block of memory on the target

and the size of this block. The list 330 is not ordered, and allocation is achieved with a traditional "first-fit" search of the list. The third data structure according to this embodiment is an AVL-tree or binary tree with nodes containing target memory block information sorted within the tree from lower target address to higher target address. This data

25

structure is traversed whenever target memory is deallocated, in order to find the host-resident node that corresponds to the target block to be deallocated. The use of a b-tree limits the worst-case search to log N, where N is the number

In prior art, the target debugging agent manages such data structures. By virtue of this invention, target systems incur less overhead in target memory allocation, thus saving resources for application use. Another virtue of this invention lies in that host-to-target protocol requests related to memory allocation are greatly reduced limiting the intrusion of host tool memory allocation on the real-time execution of a target system application.

20

25

30

35

15

10

#### 7. Virtual I/O

of allocated blocks in the tree.

Consider the scenario wherein the target 110 is a device-controlling processor. It has exactly two communications interfaces, one to receive input from a sensor and the second to control a device based on that input. The operations of the device affect the characteristic read by the sensor. The memory of the target is limited so that a target/host development environment is preferable or required. The question becomes how to connect the host when both communications ports are necessary for operation of the target.

A system incorporating the invention maintains virtual I/O (VIO) channels. A VIO channel can be implemented as a pseudo-tty, a stream as in UNIX®, an underlying device, etc. The host 150 redirects communications intended for the controlled device to a communications channel of the host machine, to which communications channel the controlled device is connected. The target machine is thus in communication

10

30

35

with the sensor, the controlled device and the host computer even though the target machine has only two communications interfaces.

On the target 110, the operating system 130b supports a virtual I/O device driver 190 to provide target applications 130a with I/O system access to the virtual I/O facility according to the standard system call protocols of the target operating system 130b. In a UNIX® - or Posix-like operating system, a first open of a VIO channel is as follows:

The host 150 associates a VIO channel with any one of a wide variety of devices, including displays, files, raw devices, streams, etc. A host application reserves a virtual I/O channel using a system call. As a VIO channel number is used by both the server and agent to determine the source or destination of particular data, the host and target applications must agree as to the channel number. The host application typically then redirects the reserved VIO channel to an actual device on the host, using a system call to control the VIO channel. Following is a code sample redirecting VIO channel 4 to a file:

27

/\* send VIO redirection command of channel 4 to
/tmp/testfile \*/

if (wtxVioCtl (hWtx, 4, VIO\_REDIRECT, fdHost != WTX\_OK)
 return (ERROR);

Any data which the debugging agent 110 reads from or writes to "/vio/4," i.e., VIO channel 4, the target server 140 will redirect from or to "/tmp/testfile."

On the host side, the target server 140 implements the mechanics of VIO channels. The debugging server 140 maintains the VIO channel pool, allocating free VIO channels and freeing VIO channels on request. As illustrated above, the target server 140 also redirects data as requested.

FIG. 4 illustrates some of the levels of software supporting an embodiment of a virtual I/O channel (from the target to the host). The host 400 includes a virtual console 410 in communication with the target server software 460. target server 460 is in communication with the corresponding debugging agent software 420 residing on the target 440. on the target 440 is the virtual I/O device driver 430, which is preferably a part of the operating system residing on the Application software 450 makes the standard target 440. operating system calls (e.g., write() on a UNIX@-like operating system) to the virtual I/O device driver. 4, the virtual I/O device driver 430 instructs the debugging agent to transfer a buffer containing "Hello world\n" to the debugging server, which then directs the display of the string in the Xterm window 450 on the host 400.

#### 8. <u>In-Circuit Emulation</u>

5

10

15

20

25

30

35

40

Consider targets with severely limited hardware resources. Providing a meaningful low cost development environment for this class of targets has historically proven difficult to impossible. For example, consider a target without the critical resource of a communications channel to the host, typically an ethernet or serial communications device. With no means of communication between the target and the host, interacting with the target during development of an embedded application is apparently impossible.

10

15

20

25

30

35

The invention exploits the power of ROM emulators and in-circuit emulators in general in a development environment, connecting a host-based development environment to a target-resident debugging agent in a resource-constrained environment. The advantages of such an approach in so connecting a debugging agent to the host server are two-fold. First, the ROM emulation provides a full-featured debugging environment to those targets in which no such environment could have existed in prior art.

Second, the intermediated communications device now provides the communications and off-loads from the target environment a significant portion of the debugging agent dedicated to communications. As mentioned above, the RAM of the target is often a critical resource. Any technique that reduces the footprint (i.e., storage requirements) on the target of the debugging environment is highly desirable, leaving more target RAM available for application needs. Where a NetROM provides the communications, it reduces the role of the debugging agent and in turn effects considerable savings in target RAM. The local agent program code to manage communications with the host would otherwise occupy the now conserved RAM.

#### 9. World-Wide Access

An added benefit of the target/host configuration upon which the prior art has not capitalized is the realization that the host need not be in physical proximity to the target. Figure 5 illustrates a target 520 and a host 530 connected by means of remote communications technology 510. With the use of a WAN, e.g., the global telephone system with modem interfaces or a satellite communications system, the host 530 and/or target 520 can be anywhere on earth or near space where a port 540 onto the WAN (e.g., a telephone or satellite link) is available. With other known remote communications technology 510, the host 530 and/or target 520 can be anywhere that technology reaches. The communications technology 510 can be a switching network, allowing any host

29

530 with the address (e.g., telephone number) of the target modem to connect with the target 520.

5

10

15

20

25

30

When viewed as a representative of the target on the host, a host-based target server can broker requests for virtually any site. The Internet, with its suite of applications tracking who's who and where they are, is an excellent example of the remote communications technology 510. Its suite of remote services makes possible bringing up a shell, for example, on any target 520 with a known address. Further, a user can register a target server 530 globally, and an authenticated user can bring up any tool on the associated target 520 for the purpose of remote development, demonstration, technical support, etc. This paradigm of distributed World-Wide access to real-time execution platforms shifts the standard usage patterns of cross-development environments from one of single-user work environments to one of distributed development.

In one embodiment, the invention provides on-line access to all registered target systems 520 via their target servers 530 by means of a centralized control panel 160a (Fig. 1) summarizing all available targets and all available tools.

#### CONCLUSION

In using the above-disclosed methods and apparatus, the Assignee of this invention has succeeded in reducing the size of its debugging agents by an order of magnitude over the prior art. Typical agents according to the invention herein are approximately 20 kilobytes (Kb). By comparison, prior art agents are around 330 to 340 Kb. Consequently, the debugging agents disclosed herein are capable of running in a much broader range of target computers, particularly in those with memory less than 300 Kb.

Of course, where memory is not a constraint, the instant invention may be practiced with a single computer system running both the server and the agent.

5

## WHAT IS CLAIMED IS:

| 1        | 1. A method for retrieving data from a first                                             |
|----------|------------------------------------------------------------------------------------------|
| 2        | memory of a first processor, said method comprising:                                     |
| 3        | coupling said first processor to a second processor                                      |
| 4        | by a communications link, said second processor having a                                 |
| . 5      | second memory;                                                                           |
| . 6      | generating on said second processor a program in an                                      |
| <b>7</b> | interpreted language, said program for reading data from                                 |
| 8        | memory and for communicating said read data over said                                    |
| 9        | communications link;                                                                     |
| 10       | communicating said program from said second                                              |
| 11       | processor to said first processor;                                                       |
| 12       | receiving said program on said first processor;                                          |
| 13       | interpreting said program on said first processor,                                       |
| 14       | thereby reading data from said first memory and                                          |
| 15       | communicating said read data from said first processor to                                |
| 16       | said second processor over said communications link; and                                 |
| 17       | receiving said read data on said second processor                                        |
| 18       | and storing said read data in said second memory.                                        |
| 1        | 2. The method of claim 1 whomein gold at an a                                            |
| 2        | <ol><li>The method of claim 1 wherein said step of<br/>communicating comprises</li></ol> |
| 3        | compressing said program and                                                             |
| 4        | said step of receiving said program comprises                                            |
| 5        | decompressing said program.                                                              |
|          | accomplessing said program.                                                              |
| 1        | 3. The method of claim 1 wherein said step of                                            |
| 2        | interpreting comprises                                                                   |
| 3        | compressing said read data and                                                           |
| 4        | said step of receiving said read data comprises                                          |
| 5        | decompressing said read data.                                                            |
| , .      |                                                                                          |
| 1        | 4. An apparatus for debugging an application                                             |
| 2        | program, said apparatus comprising:                                                      |
| 3        | a first processor having a first memory, said memory                                     |
| _        | •                                                                                        |

containing said application program and an interpreter of

a program language;

PCT/US97/14842

| 6  | a user input device through which a user can                       |
|----|--------------------------------------------------------------------|
| 7  | communicate a demand to examine a portion of said first            |
| 8  | memory;                                                            |
| 9  | a communications link; and                                         |
| 10 | a second processor, coupled to said first processor                |
| 11 | by means of said communications link and coupled to said           |
| 12 | user input device, said second processor having a second           |
| 13 | memory containing a program adapted to, in response to             |
| 14 | said demand, generate a program in said interpreted                |
| 15 | program language, said generated program for reading said          |
| 16 | portion of said first memory and for communicating said            |
| 17 | read portion over said communications link to said second          |
| 18 | processor.                                                         |
|    |                                                                    |
| 1  | 5. A method for debugging an application program,                  |
| 2  | said method comprising:                                            |
| 3  | coupling a first processor having a first memory to                |
| 4  | a second processor having a second memory by means of a            |
| 5  | communications link;                                               |
| 6  | coupling said second processor to a user input                     |
| 7  | device on which a user can communicate a demand for a              |
| 8  | debugging service;                                                 |
| 9  | running a debugging program and said application                   |
| 10 | program in said first memory;                                      |
| 11 | modifying, in response to said demand, the program                 |
| 12 | text of said debugging program, enabling said debugging            |
| 13 | program to support said debugging service.                         |
| _  |                                                                    |
| 1  | <ol> <li>A system for debugging an application program,</li> </ol> |
| 2  | said system comprising:                                            |
| 3  | a first processor having a first memory, said memory               |
| 4  | containing said application program and a debugging                |
| 5  | program adapted to perform debugging services;                     |
| 6  | a user input device on which a user communicates a                 |
| 7  | demand for a debugging service; and                                |
| 8  | a second processor, coupled to said first processor                |
| 9  | and coupled to said user input device, said second                 |
| 10 | processor having a second memory, said second memory               |

| 11 | containing a program adapted to interpret said demand and |
|----|-----------------------------------------------------------|
| 12 | to modify the program text of said debugging program to   |
| 13 | enable said debugging program to support said debugging   |
| 14 | service.                                                  |

- 7. The system of claim 6 wherein said program adapted to interpret is further adapted to communicate program code for performing said debugging service to said debugging program and said debugging program is adapted to bind said program code to itself and thereby become capable of performing said debugging service.
- 8. The system of claim 6 wherein said debugging program is further adapted to unbind said program code from itself when said debugging service has not been accessed within a predetermined amount of time.
- 9. A system for debugging an application program, said system comprising:
  - a first processor having a first memory, said first memory containing a symbol table for a core, said core being the linkage of application code for said application program and debugger code for a program for debugging said application program;
  - a second processor, coupled to said first processor, said second processor having a second memory containing a copy of said core uploaded from said first memory; and
  - a user input device, coupled to said first processor, on which a user communicates a demand for a debugging service, said debugging service performed by executing said debugging code in said copy of said core.
  - 10. A method for debugging an application program, said method comprising:
  - coupling a first computer having first memory to a second computer having second memory;
    - linking together application code for said application program with debugger code for debugging said

| 7  | application program, forming a core, said core having a        |
|----|----------------------------------------------------------------|
| 8  | symbol table;                                                  |
| 9  | placing a copy of said core in said first memory and           |
| 10 | a copy of said symbol table in said second memory;             |
| 11 | interpreting a demand for a debugging service input            |
| 12 | on a user input device coupled to said second processor;       |
| 13 | and                                                            |
| 14 | performing said debugging service on said first                |
| 15 | processor by executing said debugger code.                     |
|    |                                                                |
| 1  | 11. The method of claim 10 wherein before said step            |
| 2  | of performing is performed, the following steps are performed: |
| 3  | checking said symbol table to determine whether code           |
| 4  | to perform said debugging service is already resident in       |
| 5  | said first memory;                                             |
| 6  | uploading code to perform said debugging service               |
| 7  | when said checking determines that code to perform said        |
| 8  | debugging service is not already resident in said first        |
| 9  | memory and modifying said symbol table to reflect the          |
| 10 | fact that code to perform said debugging service is            |
| 11 | resident in said first memory; and                             |
| 12 | refraining from uploading code to execute said                 |
| 13 | demand when said checking determines that code to perform      |
| 14 | said debugging service is already resident in said first       |
| 15 | memory.                                                        |
|    |                                                                |
| 1  | 12. A method for reading memory, said method                   |
| 2  | comprising:                                                    |
| 3  | coupling a first computer having first memory to a             |
| 4  | second computer having second memory by means of a             |
| 5  | communications link;                                           |
| .6 | running a program in said first memory, thereby                |
| 7  | modifying said first memory;                                   |
| 8  | receiving a first command at a first time on said              |
| 9  | second computer, the performance of said first command         |
| 10 | requiring the reading of a portion of said first memory;       |
| 11 | reading said portion of said first memory into said            |
| 12 | second memory;                                                 |

| 13  | receiving a second command at a second time on said        |
|-----|------------------------------------------------------------|
| 14  | second computer, the performance of said second command    |
| 15  | requiring the reading of said portion of said first        |
| 16  | memory;                                                    |
| 17  | reading said portion of said first memory into said        |
| 18  | second memory when the difference between said second and  |
| 19  | first times is greater than a predetermined value, and     |
| 20  | refraining from reading said portion of said first memory  |
| 21  | into said second memory when said difference is not        |
| 22  | greater than said predetermined value.                     |
|     |                                                            |
| 1.  | 13. The method of claim 12 wherein said second step        |
| . 2 | of reading further comprises                               |
| 3   | in any event, reading said portion of said first           |
| 4   | memory into said second memory when said predetermined     |
| 5   | value equals a second predetermined value.                 |
|     |                                                            |
| 1   | 14. The method of claim 12 wherein said second step        |
| . 2 | of reading further comprises                               |
| 3   | in any event, reading said portion of said first           |
| 4   | memory into said second memory when said portion of        |
| 5   | memory is associated with a system object.                 |
|     |                                                            |
| Ţ   | 15. The method of claim 12 wherein before said             |
| 2   | second step of reading is performed, the following step is |
| 3   | performed:                                                 |
| 4   | setting said predetermined value in response to a          |
| 5   | command on a user input device.                            |
|     |                                                            |
| 1   | 16. A system for inspecting memory, said system            |
| 2   | comprising:                                                |
| 3   | an inter-processor communications link;                    |
| 4   | a first processor having first memory, a program           |
| 5   | running in said first memory and modifying said first      |
| 6   | memory;                                                    |
| 7   | an input device for receiving commands from a user;        |
| 8   | and and                                                    |

| 9   | a second processor, coupled to said lifet processor      |  |  |  |  |  |
|-----|----------------------------------------------------------|--|--|--|--|--|
| 10  | by means of said inter-processor communications link and |  |  |  |  |  |
| 11  | coupled to said input device, said second processor      |  |  |  |  |  |
| 12  | having                                                   |  |  |  |  |  |
| 13  | second memory;                                           |  |  |  |  |  |
| 14  | a register holding a time quantum;                       |  |  |  |  |  |
| 15  | an arithmetic logic unit; and                            |  |  |  |  |  |
| 16  | a CPU under program control, said program                |  |  |  |  |  |
| 17  | adapted to receive a first command at a first            |  |  |  |  |  |
| 18  | time by means of said input device, the                  |  |  |  |  |  |
| 19  | performance of said first command requiring the          |  |  |  |  |  |
| 20  | reading of a portion of said first memory; to            |  |  |  |  |  |
| 21  | read said portion of said first memory into              |  |  |  |  |  |
| 22  | said second memory; to receive a second command          |  |  |  |  |  |
| 23  | at a second time by means of said input device,          |  |  |  |  |  |
| 24  | the performance of said second command                   |  |  |  |  |  |
| 25  | requiring the reading of said portion of said            |  |  |  |  |  |
| 26  | first memory; to read said portion of said               |  |  |  |  |  |
| 27  | first memory into said second memory when the            |  |  |  |  |  |
| 28  | difference between said second and first times           |  |  |  |  |  |
| 29  | is greater than a predetermined value; and to            |  |  |  |  |  |
| 30  | refrain from reading said portion of said first          |  |  |  |  |  |
| 31  | memory into said second memory when said                 |  |  |  |  |  |
| 32  | difference is not greater than said                      |  |  |  |  |  |
| 33  | predetermined value.                                     |  |  |  |  |  |
| , , |                                                          |  |  |  |  |  |
| 1   | 17. A method for modifying memory, said method           |  |  |  |  |  |
| 2   | comprising:                                              |  |  |  |  |  |
| 3   | coupling a first processor having first memory to a      |  |  |  |  |  |
| 4   | second processor having second memory by means of a      |  |  |  |  |  |
| 5   | communications link;                                     |  |  |  |  |  |
| 6   | downloading a copy of first data from said first         |  |  |  |  |  |
| 7   | memory to said second memory;                            |  |  |  |  |  |
| 8   | generating second data in said second memory;            |  |  |  |  |  |
| 9   | generating a script for modifying said first data to     |  |  |  |  |  |
| 10  | produce said second data;                                |  |  |  |  |  |
| 11  | transmitting said script to said first processor;        |  |  |  |  |  |
|     |                                                          |  |  |  |  |  |

| 12          | executing said script, modifying said first data t                                                        |
|-------------|-----------------------------------------------------------------------------------------------------------|
| <b>13</b> . | produce a copy of said second data in said first memory                                                   |
|             | and it seemsty                                                                                            |
| . 1         | 18. The method of claim 17 wherein said step of                                                           |
| 2           | executing comprises                                                                                       |
| 3           | moving a portion of said first data from a first                                                          |
| 4           | area in said first memory;                                                                                |
| 5           | copying a portion of said second data to said first                                                       |
| - 6         | area.                                                                                                     |
|             |                                                                                                           |
| 1           | 19. The method of claim 17 wherein said step of                                                           |
| 2           | executing comprises                                                                                       |
| . 3         | overwriting a first portion of said first data with                                                       |
| 4           | a second portion of said first data;                                                                      |
| 5           | copying a portion of said second data to said first                                                       |
| 6           | memory.                                                                                                   |
| •           |                                                                                                           |
| 1           | 20. The method of claim 17, further comprising the                                                        |
| 2<br>3      | step of                                                                                                   |
| 3<br>4      | re-linking said copy of said second data in said                                                          |
| *           | first memory.                                                                                             |
| 1           | 21 2 2 2 2 2                                                                                              |
| 2           | 21. A system for managing the memory of a                                                                 |
| 3           | processor, said system comprising:                                                                        |
| 4           | said processor having said memory;                                                                        |
| - 5         | a first application in said memory;                                                                       |
| 6           | a second application in said memory, said second                                                          |
| 7           | application adapted to develop said first application,                                                    |
| 8           | said second application adapted to reserve free space                                                     |
| 9           | from said memory and requiring memory management services for said free space;                            |
| 10          | a communications link;                                                                                    |
| 11          |                                                                                                           |
| 12          | a second processor, coupled to said processor by means of said communications link, said second processor |
| 13          | having second memory; and                                                                                 |
| 14          | a third application in said second memory, said                                                           |
| 15          | third application adapted to provide said memory                                                          |
|             | ti adapted to provide Said memory                                                                         |

| 16 | management services for said free space over said         |
|----|-----------------------------------------------------------|
| 17 | communications link.                                      |
| 1  | 22. A method for managing the memory of a                 |
| 2  | processor, said method comprising:                        |
| 3  | coupling said processor to a second processor by          |
| 4  | means of a communications link, said second processor     |
| 5  | having second memory;                                     |
| 6  | running a first application in said memory;               |
| 7  | running a second applications in said memory which        |
| 8  | reserves free memory, said second application for         |
| 9  | developing said first application, said second            |
| 10 | application requiring memory management services for said |
| 11 | free memory;                                              |
| 12 | running a third application in said second memory,        |
| 13 | said third application adapted to provide memory          |
| 14 | management services over said communications link;        |
| 15 | requesting memory management services of said second      |
| 16 | processor, thereby generating instructions for said first |
| 17 | processor;                                                |
| 18 | managing the free memory of said first processor in       |
| 19 | response to said instructions.                            |
| 1  | 23. An apparatus for analyzing a                          |
| 2  | processor-controlled system, said apparatus comprising:   |
| 3  | an external device adapted to be controlled by a          |
| 4  | processor;                                                |
| 5  | a first processor having memory, said first               |
| 6  | processor adapted for coupling to said external device,   |
| 7  | said first processor having exactly the number of         |
| 8  | communications channels necessary for said first          |
| 9  | processor to operate in final configuration including     |
| 10 | running an application in said memory, thereby            |
| 11 | controlling said external device; and                     |
| 12 | a second processor having second memory, said second      |
| 13 | processor coupled to said first processor by means of one |
| 14 | of said number of communication channels and coupled to   |
| 15 | said external device by means of a communication channel, |

1 2

3

4

5

6 7

8

9

10

12

13

14 15

16

17

18

1

2 3

. 5

7

8

9

10

| 16  | said second processor running a second application in     |
|-----|-----------------------------------------------------------|
| .17 | said second memory, said second application adapted to    |
| 18  | develop said first application, including communicating   |
| 19  | data to said external device, said data controlling said  |
| 20  | external device, said data received over said one of said |
| 21  | number of communications channels and communicated over   |
| 22  | said communications channel.                              |

24. A method for analyzing a processor-controlled system, said method comprising:

coupling a first processor to a second processor by means of a first communications channel;

coupling an external device to said second processor by means of a second communications channel, said external device adapted to be controlled by said first processor by means of said first communications channel;

generating data to control said external device; communicating said data from said first processor to said second processor over said first communications channel;

receiving said data in said second processor and communicating said data via said second communications channel to said external device; and

sending second data to analyze said first processor between said first and second processors via said first communications channel.

- 25. An apparatus for analyzing a
  processor-controlled systems, said apparatus comprising:
  - a plurality of processor-controlled systems interconnected by means of a network;
  - a registry, coupled to said network, said registry adapted to register processor-controlled systems;
  - a processor, coupled to said network and registered with said registry, said processor having a user input device, adapted to receive a command on said user input device and adapted to establish a communication channel

2

test, said system comprising:

| 11 | with a one of said plurality of processor-controlled    |
|----|---------------------------------------------------------|
| 12 | systems specified in said command.                      |
|    |                                                         |
| 1  | 26. In a target/host computer system, an agent          |
| 2  | having a size less than 300 kilobytes (Kb).             |
| :  |                                                         |
| 1  | 27. The computer system of claim 26 wherein said        |
| 2  | agent has a size less than 200 Kb.                      |
|    |                                                         |
| 1  | 28. The computer system of claim 26 wherein said        |
| 2  | agent has a size less than 100 Kb.                      |
|    |                                                         |
| 1  | 29. The computer system of claim 26 wherein said        |
| 2  | debugging agent has a size of about 20 Kb.              |
|    |                                                         |
| 1  | 30. A method for debugging a software program under     |
| 2  | test, said method comprising the steps of:              |
| 3  | executing on a first processor development software     |
| 4  | having an interface;                                    |
| 5  | loading said software program under test in the         |
| 6  | memory of a second processor having a receptacle for    |
| 7  | receiving hardware capable of being read or written;    |
| 8  | executing software for examining said software          |
| 9  | program under test on said second processor;            |
| 10 | placing an emulator of said hardware in said            |
| 11 | receptacle; and                                         |
| 12 | communicating between said first and second             |
| 13 | processors by means of said emulator.                   |
|    |                                                         |
| 1  | 31. The method of claim 30 wherein said step of         |
| 2  | communicating comprises                                 |
| 3  | communicating between said first and second             |
| 4  | processors by means of said interface and said software |
| 5  | for examining said software program under test.         |
|    |                                                         |
| _  | an a further for debugging a coftwarp program under     |

| a first processor having a memory development software having an interfa a second processor having | ice;              |
|----------------------------------------------------------------------------------------------------|-------------------|
|                                                                                                    |                   |
| · T                                                                                                |                   |
| 6 a receptacle for receiv                                                                          | ing hardware      |
| 7 capable of being read or wri                                                                     |                   |
| a memory wherein is loc                                                                            |                   |
| 9 program under test and softw                                                                     |                   |
| said software program under                                                                        | test: and         |
| an emulator of said hardware, locations                                                            |                   |
| 12 receptacle and communicatively coupling                                                         | g said first and  |
| second processors.                                                                                 | y IIIbt and       |
| 1 33. The system of claim 32 where                                                                 | in said onulation |
| 2 comprises                                                                                        | in said emulator  |
| an emulator communicatively coupli                                                                 | ina said          |
| development software and first and said                                                            |                   |
| 5 examining.                                                                                       | . DOLEMATE TOT    |



SUBSTITUTE SHEET (RULE 28)



V

F/G.

SUBSTITUTE SHEET (RULE 26)



FIG. 3.
SUBSTITUTE SHEFT (RULE 28)





*F/G. 5.*SUBSTITUTE SHEET (RULE 28)



FIG. 6.

SUBSTITUTE SHEET (RULE 28)

THIS PAGE IS LANGER WESTERN



## WORLD INTELLECTUAL PROPERTY ORGANIZATION International Bureau



# INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT)

- (51) International Patent Classification 6: G06F 9/455, 9/40, 9/45, 11/30, 15/46
- (11) International Publication Number:

WO 98/09208

(43) International Publication Date:

5 March 1998 (05.03.98)

(21) International Application Number:

PCT/US97/14842

**A3** 

(22) International Filing Date:

21 August 1997 (21.08.97)

(30) Priority Data:

\_28 August 1996 (28.08.96) -08/705;881 --

- (71) Applicant: WIND RIVER SYSTEMS, INC. [US/US]; 1010 Atlantic Avenue, Alameda, CA 94501 (US).
- (72) Inventors: FOGELIN, John, C.; 250 Coleridge Street, San Francisco, CA 94110 (US). SMITH, Colin; 22, impasse de la Capitainerie, F-56860 Sene (FR). WILNER, David, N.; 2733 Claremont Boulevard, Berkeley, CA 94705 (US). HARTMAN, John; 4680 Davenport Avenue, Oakland, CA 94619 (US). LONG, Kent, D.; 6109 McBride Avenue, Richmond, CA 94805 (US). STEIMAN-CAMERON, Debbie; 30 Waterside Circle, Redwood City, CA 94065 (US). SHEPARD, Marc; 4584 Santa Rita Road, Richmond, CA 94803 (US). ZHOU, Yiwen; 2020 Franciscan Way #208, Alameda, CA 94501 (US). WADDINGTON, Simon; 3 Embarcadero West #341, Oakland, CA 94607 (US). MAISON-NEUVE, Philippe; 1, allée Louis Aubert, F-56000 Vannes (FR). DARLET, Pierre-Alain; 17, square les Bouleaux, F-56610 Arradon (FR). PREYSSLER, Thierry; 2615 Telegraph Avenue #203, Berkeley, CA 94704 (US). ORIOT, Jean-Claude; 3, rue de Ranquin, F-56860 Sene (FR).
- ents: MENDENHALL, Larry et al.; Townsend and Townsend and Crew LLP, 8th floor, Two Embarcadero Center, San Francisco, CA 94111 (US).
- (81) Designated States: JP, European patent (AT, BE, CH, DE, DK, ES, FI, FR, GB, GR, IE, IT, LU, MC, NL, PT, SE).

#### Published

With international search report.

(88) Date of publication of the international search report: 30 July 1998 (30.07.98)

(54) Title: A TOOL FOR SOFTWARE DIAGNOSIS DIVIDED ACROSS MULTIPLE PROCESSORS



#### (57) Abstract

A target/host computer system wherein traffic on the connecting communications link and the size of the agent in the target are reduced by using Gophers, debugging services on demand, shared run-time libraries, quantized streaming differential caches, host-based target memory management, virtual I/O and in-circuit emulation. The figure illustrates a target computer (520) and a host computer (530), connected by means of remote communications technology (510), where ports onto a WAN (540A, 540B), telephone or satelite links, are available.

### FOR THE PURPOSES OF INFORMATION ONLY

Codes used to identify States party to the PCT on the front pages of pamphlets publishing international applications under the PCT.

|      |                          |      |                     | •     |                       |    |                                       |
|------|--------------------------|------|---------------------|-------|-----------------------|----|---------------------------------------|
| AL   | Albania                  | ES   | Spain               | LS    | Lesotho               | SI | Slovenia                              |
| AM   | Armenia                  | FI   | Finland             | LT    | Lithuania             | SK | Slovakia                              |
| AT   | Austria                  | FR   | France              | LU    | Luxembourg            | SN | Senegal                               |
| AU   | Australia                | - GA | Gabon               | LV    | Latvia                | SZ | Swaziland                             |
| ΑZ   | Azerbaijan               | GB   | United Kingdom      | MC    | Моласо                | TD | Chad                                  |
| BA   | Bosnia and Herzegovina   | GE   | Georgia             | MD .  | Republic of Moldova   | TG | Togo                                  |
| BB   | Barbados                 | GH   | Ghana               | MG    | Madagascar            | TJ | Tajikistan                            |
| BE   | Belgium                  | GN   | Guinea              | MK    | The former Yugoslav   | TM | Turkmenistan                          |
| BF   | Burkina Faso             | GR   | Greece              |       | Republic of Macedonia | TR | Turkey                                |
| BG   | Bulgaria                 | HU   | Hungary             | ML    | Mali                  | TT | Trinidad and Tobago                   |
| BJ - | Benin                    | IE   | Ireland             | MN    | Mongolia              | UA | Ukraine                               |
| BR   | Brazil                   | · IL | Israel              | MR    | Mauritania            | UG | Uganda                                |
| BY   | Belarus                  | IS ` | Iceland             | MW    | Malawi                | US | United States of America              |
| CA   | Canada                   | IT   | Italy               | MX    | Mexico                | UZ | Uzbekistan                            |
| CF   | Central African Republic | JP   | Japan               | NE    | Niger                 | VN | Viet Nam                              |
| CG · | Congo                    | KE   | Kenya 💉             | NL    | Netherlands           | YU | · · · · · · · · · · · · · · · · · · · |
| CH   | Switzerland              | KG   | Kyrgyzstan          | NO    | Norway                | zw | Yugoslavia                            |
| CI   | Côte d'Ivoire            | KP   | Democratic People's | NZ    | New Zealand           | ZW | Zimbabwe '                            |
| CM   | Cameroon                 |      | Republic of Korea   | PL    | Poland                |    |                                       |
| CN   | China                    | KR   | Republic of Korea   | PT    | Portugal              |    |                                       |
| CU   | Cuba                     | KZ   | Kazakstan           | RO    | Romania               |    |                                       |
| CZ · | Czech Republic           | LC   | Saint Lucia         | RU    | Russian Federation    |    |                                       |
| DE · | Germany                  | LI   | Liechtenstein       | SD    | Sudan                 |    |                                       |
| DK · | Denmark                  | LK   | Sri Lanka           | SE SE | Sweden                |    |                                       |
| EE   | Estonia                  | LR   | Liberia             | SG    | Singapore             |    |                                       |